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CROSS-REFERENCE TO RELATED APPLICATIONS 

[0001] This application claims the benefit of Korean Patent Application No. 2002- 73807, 
filed November 26. 2002, in the Korean Intellectual Property Office, the disclosure of which is 
incorporated herein by reference. 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

[0002] The present Invention relates to a data processing system, and more particularly, to a 
method of backing up and recovering data In a data processing system. 

2. Description of the Related Art 

[0003] In a data processing system such as a computer, data recorded in a file form needs to 
be backed-up to protect against a crash of the data processing system or for future 
contingencies, thereby recovering original data. A user can personally back up and recover a 
file, but as the amount of data to be managed is increased, it is difficult for a user to periodically 
make the backup or to back up and recover a file to be selected one by one. 

[0004] Accordingly, recently, various application software has been developed to improve 
convenience for data backup and recovery. The processes of data backup and data recovery 
using such application software will be described hereinbelow. 

[0005] In the data processing system, data recorded in the file form Is electromagnetically 
stored in a storage unit such as a hard disk. As shown in FIG. 3, the process of backing up the 
stored data is as follows. First, a user selects files to be backed up (SI 00), and detemiines 
where to save backup files (SI 01). Then, when a user orders the data backup to begin (SI 02), 
a new project file is created for saving information on the backup files, or an old project file 
created at the last backup process is loaded (S103^S105). While the data backup is performed 
(S106). the information on the backup files is added into the project file, thereby completing the 
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data backup (SI 07). 

[0006] Further, as shown in FIG. 4, the process of recovering the backup files is as follows. 
First, a user loads the project file (S1 10), and selects the backup files to be recovered (S1 11). 
Then, when a user orders the data recovery to begin (S1 12), the backup file is recovered 
(S1 13), thereby completing the data recovery (S1 14). 

[0007] In the conventional method for the data backup, after files (e.g„ a file 1 , a file 2, a file 
3 and a file 4) whose information is saved in the project file are initially backed up (i.e., after a 
first backup process), if there is a changed file (e.g., the file 1 is changed into a file 1.1) among 
the backup files, the information on only the changed file (the file 1 .1) is added into the project 
file when the data backup is subsequently performed (i.e., in a second backup process). 
Therefore, the other backup files which are not changed are not electromagnetically stored in 
the storage unit again at the second backup process. Moreover, such unchanged backup files 
are not treated as files which are logically backed up together with the changed file. That is, the 
project file of the first backup process includes the infomiafion on the file 1 , the file 2, the file 3 
and the file 4, but the project file of the second backup process includes the information on only 
the file 1.1. 

[0008] Thus, in the case of recovering the backup files of a specific backup process (e.g., the 
second backup process), only the files (e.g., the file 1.1) which are backed up at the specific 
backup process are recovered, so that the files related to each other are not all recovered. In 
order to recover the related files, a user must remember the specific backup process and select 
the related files. Hence, in the case that the data backup is repeatedly perfomned and an 
amount of data is managed, the data backup and recovery are not easy. 

SUMMARY OF THE INVENTION 

[0009] Accordingly, it is an aspect of the present invention to provide a method of backing up 
and recovering data by using a baseline. 

[0010] Additional aspects and advantages of the present invenfion will be set forth in part in 
the description that follows and, in part, will be obvious form the description, or may be learned 
by practicing the present invention. 
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[0011] The foregoing and/or other aspects of the present invention are achieved by providing 
a method of bacl<ing up and recovering data in a data processing system, comprising selecting 
files to be backed up, and determining where to save selected files; loading a project file to 
manage backup information on the selected file; building a baseline in the project file; backing 
up the selected files with the baseline, and updating a backup file database (DB) including 
information on storage addresses where the selected files are saved; and completing data 
backup by saving an updated backup file DB in the project file. 

[0012] According to an aspect of the invention, the baseline includes information on time of 
when the data backup is performed and information on the selected files to be backed up. 

[0013] According to an aspect of the invention, in a case that there is a changed or new 
created file in the data processing system after the data backup is performed, the updating the 
backup file DB includes logically linking the storage addresses of unchanged files of the 
selected files. 

[0014] According to an aspect of the invention, the loading the project file includes creating 
the project file to manage the backup information on the selected file, and the updating the 
backup file DB includes creating the backup file DB including the information on the storage 
addresses. 

[0015] According to an aspect of the invention, after the data backup is performed, the 
method further comprises loading the project file; selecting the baseline related to the selected 
files to be recovered among baselines of the project file; and recovering the selected files 
recorded in the backup file DB based on a selected baseline. 

[0016] According to an aspect of the invention, after the data backup is performed, the 
method further comprises loading the project file; selecting the baseline related to the selected 
files to be recovered among baselines of the project file; and recovering the selected files 
recorded in the backup file DB based on a selected baseline. 

[0017] According to an aspect of the invention, after the data backup is perfomned, the 
method further comprises loading the project file; selecting the baseline related to the selected 
files to be recovered among baselines of the project file; and recovering the selected files 
recorded in the backup file DB based on a selected baseline. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0018] These and/or other aspects and advantages of the present invention will become 
apparent and more readily appreciated from the following description of the embodiments, taken 
in conjunction with the accompany drawings of which: 

[00191 FIG. 1 is a flowchart of a method of backing up data according to an embodiment of 
the present invention; 

[0020] FIG. 2 is a flowchart of a method of recovering data according to an embodiment of 
the present invention; and 

[0021] FIGS. 3 and 4 are flowcharts of a conventional method of backing up and recovering 
data, respectively. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

[0022] Reference will now be made in detail to the embodiments of the present invention, 
examples of which are illustrated in the accompanying drawings, wherein like reference 
numerals refer to like elements throughout. The embodiments are described below in order to 
explain the present invention by referring to the figures. 

[0023] In a data processing system using a method of data backup and recovery according 
to the present invention, data recorded in file fomi is physically or electromagnetically stored in 
a storage unit. The storage unit generally has high speed and large storage capacity, and 
includes a hard disk, an optical disk, a RAID (redundant array of independent disks) system, 
etc. The data processing system uses an operating system such as WINDOWS, etc. 
supporting application software used to back up and recover data according to the present 
invention. The data backup and recovery software includes a driver that logically interacts with 
hardware components of the data processing system. 

[0024] Thus, a user can back up and recover data recorded in file fomi by using the data 
backup and recovery software, and the process of backing up and recovering data will be 
described with reference to accompanying drawings. 

[0025] FIG. 1 is a flowchart of a method of backing up data according to the present 
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invention. As shown therein, a user first selects files to be backed up (S1 1). and detemiines 
where to save backup files selected by a user (S12). Then, when a user orders the data backup 
of selected files to begin (S13), it is detennined whether a project file for saving infomnation on 
the backup files exists (S14). When the project file exists, the project file is loaded (SI 5). 
Conversely, when there is no project file, a new project file is created (S16). According to the 
present invention, because the project file is created once when the data backup is first 
performed in the data processing system, the existing project file is preferably loaded when the 
data backup is performed later on. Then, a baseline is built in the loaded or created project file 
so as to set up information on time of when the data backup is performed and backup 
information on the files related to each other being selected by a user (S17). 

[0026] Preferably, whenever the data backup is performed, the baseline is newly built in the 
project file, being discriminated from the old baseline. That is, the new baseline can be labeled 
"backup time" whenever the data backup is performed, thereby being discriminated from the old 
base line. For example, in the case that the data backup is dated January 1 , 2002, the baseline 
is labeled "20020101", and in the case that the data backup is dated February 1, 2002, the 
baseline is labeled "20020201". However, the baseline may be labeled in different formats as 
necessary. 

[0027] Thereafter, while the selected files are backed up together with the baseline, an index 
of the files related to the files to be backed up and an index of the related files to be backed up 
together are collectively recorded in a backup file DB (database) (SI 9). Herein, if the data 
backup is first performed, it is determined whether the backup file DB exists or not (SI 8). When 
there is no backup file DB, the new backup file DB is newly organized (S20), but when the 
backup file DB exists, the backup file DB is just updated (SI 9). 

[0028] The backup file DB corresponding to each baseline includes the indexes of the 
backup files. Further, the indexes of the backup files receded in the backup file DB are linked to 
storage addresses of the storage unit, respectively. 

[0029] According to the present invention, for example, when the data backup is first 
performed in January 1, 2002, the project file is created, and the baseline of the related files 
(e.g., a file A, a file B, a file C and a file D) labeled "20020101" are saved in the project file, with 
the backup file DB recording every index of the backup files. Thereafter, there is a changed file 
(e.g., the file A is changed into a file AA), and therefore when the related files (e.g., the file AA, 
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the file B, the file C and the file D) are backed up again on February 1 , 2002, the project file is 
loaded and the baseline thereof is set up by "20020201", with the backup file DB being updated 
by every index of the backup files. At this time, the file AA changed from the file A is 
electromagnetically stored in the storage unit, and the storage address thereof is recorded in the 
backup file DB. Conversely, the unchanged files such as the file B, the file C and the file D are 
not electromagnetically stored in the storage unit, and the updated backup file DB for the 
unchanged files is just logically linked to the backup file DB having the baseline of "20020101". 

[0030] In the method of data backup and recovery according to the present invention, the 
unchanged files are not electromagnetically stored in the storage unit again, like the 
conventional method when the data backup is perfomned later on. However, contrary to the 
conventional method, such unchanged backup files are treated as files which are logically 
backed up together with the changed file. That is, the baseline of "20020101" includes the 
information on the file A, the file B, the file C and the file D, and the baseline of "20020201" 
includes the information on the file AA, the file B, the file C and the file D. 

[00311 With this process, when the selected files are all backed up (S21 ), the loaded project 
file is saved, thereby completing the data backup. Herein, the backup file DB may be managed 
being included in the project file, or separately from the project file. 

[0032] The process of recovering the backup files by using the baseline will be described 
with reference to FIG. 2. In order to recover the backup files, a user first loads the project file 
(S30). Then, a user opens the loaded project file and selects the baseline related to the backup 
files to be recovered (S31). Then, when a user orders the data recovery to begin according to a 
selected baseline (S32), the backup files are recovered (S33). At this time, the related files are 
recovered from the storage addresses of the storage unit linked to the backup file DB with 
reference to the indexes of the backup file related to the selected baseline in the backup file DB, 
thereby completing the data recovery (S34). 

[0033] As described above, when the baseline of "20020201" is selected so as to recover the 
backup files backed up in February 1, 2002, not only the changed file (the file AA) physically 
backed up in February 1 , 2002 is recovered but also the unchanged files (the file B, the file C 
and the file D) logically linked to each other in the backup file DB are recovered. Consequently, 
the related files set up in the baseline can be all recovered. 

[0034] According to the present invention, in order to recover all related backup files, a user 
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just selects the baseline of backup time without remembering each backup time of every file. 
Therefore, in the case that the data backup is repeatedly performed and an amount of data is 
managed, the data backup and recovery are easy. Further, in the case that a user periodically 
backs-up data, the data backup is performed, automatically building the baseline in the project 
file, thereby facilitating the data recovery. 

[0035] In the conventional method of the data backup and recovery, only the last changed file 
is backed up, so that the data recovery is limited to the last backup file. However, according to 
the present invention, the baseline is automatically set up when the data backup is performed, 
so that the data recovery is possible in correspondence to every baseline. 

[0036] On the other hand, in order to recover all related backup files, if all related backup files 
are elec tromagnetically stored in correspondence to every baseline, the space of the storage 
unit occupied with the backup files is increased whenever the data backup is performed, thereby 
causing a shortage of a storing space. However, according to the present invention, only the 
changed backup files are electromagnetically stored in the storage unit, and the unchanged 
backup files are just treated as files which are logically stored in the storage unit, thereby 
decreasing the space of the storage unit occupied with the backup files. 

[0037] As described above, the present invention provides a method of backing up and 
recovering data by using a baseline, in which a baseline is built in a project file whenever data 
backup is performed, so that not only the data backup and data recovery are eas y even if the 
data backup is repeatedly perfonmed and an amount of data is managed, but also a space of a 
storage unit occupied with the backup files is efficiently used. 

[0038] The hardware included in the system may include memories, processors, and/or 
Application Specific Integrated Circuits ("ASICs"). Such memory may include a machine- 
readable medium on which is stored a set of instructions (i.e., software) embodying any one, or 
all, of the methodologies described herein. Software can reside, completely or at least partially, 
within this memory and/or within the processor and/or ASICs. For the purposes of this 
specification, the term "machine-readable medium" shall be taken to include any mechanism 
that provides (i.e., stores and/or transmits) information in a form readable by a machine (e.g.. a 
computer). For example, a machine-readable medium includes read only memory ("ROM"), 
random access memory ("RAM"), magnefic disk storage media, optical storage media, flash 
memory devices, electrical, optical, acoustical, or other form of propagated signals (e.g., carrier 
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waves, infrared signals, digital signals, etc.), etc. 

[0039] Although a few ennbodiments of the present invention have been shown and 
described, it will be appreciated by those skilled in the art that changes may be made in these 
embodiments without departing from the principles and spirit of the invention, the scope of 
which is defined in the appended claims and their equivalents. 
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